Update openvino-mlir-gc integration #167

niuxiaog · 2024-08-27T05:09:13Z

No description provided.

dchigarev · 2024-09-09T10:18:02Z

@niuxiaog could you please briefly describe what this PR does?

niuxiaog · 2024-09-10T01:20:14Z

@niuxiaog could you please briefly describe what this PR does?

Sure. I will update the RFC, which introduces the constant tensor folding pass and the related parts in OV integration.

niuxiaog · 2024-09-10T05:30:54Z

@niuxiaog could you please briefly describe what this PR does?

DLTI information is attached to llvm module. Currently, default values are from SPR.
Support for constant tensors folding pass is added, including marking constant input tensors, parsing relevant information from llvm module optimized by GC, allocating buffers for folded tensors and calling fold and entry funcs correctly. This is an experimental implementation and will be optimized later.
Scripts for benchmarking is updated to support multi threading.
Building and linking errors are fixed to integrate latest main branch of GC.
Some experimental codes for testing, such as generating constant weights for mlp, splitting out the TransposeOp from linalg::MatmulTransposeBOp. These codes will be deleted before merging.

For a more complete introduction of constant tensors folding pass, please refer to RFC.

dchigarev · 2024-09-16T09:33:39Z

src/common/transformations/src/transformations/mlir/op/matmul.cpp

        const auto inputs = context.getInputs(node);
+        mlir::SmallVector<Value, 2> ins{inputs[0]};
+
+        if (isTransposedB) {


why do we need to split it?

This is experimental code to test that the TransposeOp can be moved to fold() by the constant tensor folding pass. It will be deleted later.

niuxiaog added 6 commits August 22, 2024 10:54

Add dlti specs

fe4ce5f

Set const_args_index to func

f7ea8df

Modify mlp_bench.sh

61fa926

Fix undefined symbol

9155584

Fix link error of brgemm

8ca1f9e

Enable set omp threads

1c7eb22

github-actions bot added category: build category: transformations category: tools no-match-files labels Aug 27, 2024

niuxiaog added 12 commits August 27, 2024 14:15

Update gc branch

239a36b

Add some tools and files

0e6e78d

Some updates

0816008

Generate model with Const weight

1473400

Mark input from Constant

a4fbb06

Add pipeline to call fold() and entry()

47934e2

Add a test: transpose B, matmul

2cd42c1

USe set to identify first execution

85d9fc6

Restore graph-compiler.cmake

4883b2e

Add a bf16 example

16cdbfe

no fold case

a2dee4b

Parse shapes to alloc buffer.

0162a0b

niuxiaog marked this pull request as ready for review September 10, 2024 05:37

Optimize relu ir.

9b84fde

dchigarev reviewed Sep 16, 2024

View reviewed changes

Correct func args

ad0451d

github-actions bot removed the no-match-files label Sep 18, 2024

niuxiaog added 2 commits September 19, 2024 14:31

Add accuracy test

01688c1

Align ExecutionEngine with benchgc

e03241b

slyalin mentioned this pull request Oct 7, 2024

GC-GPU integration #169

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update openvino-mlir-gc integration #167

Update openvino-mlir-gc integration #167

niuxiaog commented Aug 27, 2024

dchigarev commented Sep 9, 2024

niuxiaog commented Sep 10, 2024

niuxiaog commented Sep 10, 2024 •

edited

Loading

dchigarev Sep 16, 2024

niuxiaog Sep 18, 2024 •

edited

Loading

Update openvino-mlir-gc integration #167

Are you sure you want to change the base?

Update openvino-mlir-gc integration #167

Conversation

niuxiaog commented Aug 27, 2024

dchigarev commented Sep 9, 2024

niuxiaog commented Sep 10, 2024

niuxiaog commented Sep 10, 2024 • edited Loading

dchigarev Sep 16, 2024

Choose a reason for hiding this comment

niuxiaog Sep 18, 2024 • edited Loading

Choose a reason for hiding this comment

niuxiaog commented Sep 10, 2024 •

edited

Loading

niuxiaog Sep 18, 2024 •

edited

Loading